
STOP 



Early Journal Content on JSTOR, Free to Anyone in the World 

This article is one of nearly 500,000 scholarly works digitized and made freely available to everyone in 
the world by JSTOR. 

Known as the Early Journal Content, this set of works include research articles, news, letters, and other 
writings published in more than 200 of the oldest leading academic journals. The works date from the 
mid-seventeenth to the early twentieth centuries. 

We encourage people to read and share the Early Journal Content openly and to tell others that this 
resource exists. People may post this content online or redistribute in any way for non-commercial 
purposes. 

Read more about Early Journal Content at http://about.jstor.org/participate-jstor/individuals/early- 
journal-content . 



JSTOR is a digital library of academic journals, books, and primary source objects. JSTOR helps people 
discover, use, and build upon a wide range of content through a powerful research and teaching 
platform, and preserves this content for future generations. JSTOR is part of ITHAKA, a not-for-profit 
organization that also includes Ithaka S+R and Portico. For more information about JSTOR, please 
contact support@jstor.org. 



Voi,. 7, 1921 GENETICS: J. A. HARRIS ET AL. 213 

the place of minima. The largest displacement occurred at about 5 mm. 
from the plate. 

Pressures in Smooth Straight Pipes. — These surveys in pitch and depth 
were completed for a number of telephone blown brass pipes, 1 cm. in 
diameter and of different lengths; but the very interesting graphs ob- 
tained must be omitted here. Thus the pipe 13 cm. long showed enormous 
maxima at a' and correspondingly large negative minima at d", at all 
depths (2, 4, 8, 12 cm.) below the mouth of the tube. Figure 11 reproduces 
the behavior of the same tube cut down to 10 cm. of length, at different 
depths 2, 4, 6, 8, 10 cm. below the mouth. Finally similar graphs for a 
variety of wide tubes and resonators have been worked out and progress 
made with the installation for symmetrical induction. 



THE PREDICTION OF ANNUAL EGG PRODUCTION FROM THE 
RECORDS OF LIMITED PERIODS 

By J. Arthur Harris, W. F. Kirkpatrick and A. F. BlakbslEB 

Station for Experimental Evolution, Carnegie Institution, and the Storrs 
Agricultural Experimental Station •;•; 

Communicated by C. B. Davenport, March 12, 1921 

For the past several years the writers have been considering the possi- 
bility of predicting the annual egg production of the domestic fowl from the 
records of short periods of time. Such records may be determined by trap- 
nesting, or by the use of other criteria when the maternity of the eggs is not 
required for breeding purposes. 1 

The first definite step in the direction of the use of the egg record of a 
short period for the prediction of the production during a subsequent or a 
longer period was, as far as we are aware, taken in 1917 when it was shown 2 
that in a heterogeneous series of birds such as are submitted by practical 
breeders in egg laying contests, the October egg production is correlated 
with that of every other month of the year. The investigation was carried 
much further in a second memoir 3 in which the correlations between the 
records of the individual months and the production of the whole year, 
between the records of the individual months and those of the remaining 
11 months of the year, and between the production of 5 of the individual 
months and the production of all the other individual months, were pub- 
lished for two series of birds. In this paper the equations for the prediction 
of total annual production from the record of the individual months were 
given. 

Our purpose here is to state briefly the results of a first test of the possi- 
bility of utilizing the linear regression equation (which is strictly valid 
only for the population from which it is deduced) for the prediction of the 
records of the birds of a flock the performance of which is unknown as far 
as the determination of the constants of the equations is concerned. 



214 GENETICS: J. A. HARRIS ET AL. Proc. N. A. S. 

Itl a population the straight line relating the egg production of a period 
~Bp with that of a period used as a basis of prediction, e, is 



E,= (£ P - r eE/-^ e) + r eE t 



ffe 



where the bars denote means, the sigmas represent standard deviations, 
and r indicates the correlation of the two variables in the standard popu- 
lation. 

The value of E P given by the equation is the theoretical mean produc- 
tion for the array of individuals of any class with respect to e. The as- 
sumption to be tested is that we may write E P ' and e' instead of E P and 
e, where E/ is the theoretical mean production for a period p of the array 
of birds of any grade of production e in the period used as a basis of pre- 
diction in a series of birds which are not involved in the data upon which 
the equations were based, but to which the equations are to be applied for 
practical purposes. 

The essential practical requisites for such prediction equations are: (1) 
That the errors of prediction shall be distributed about the true numbers 
in such a manner that estimation will not in the long run be either too 
high or too low. (2) That the magnitude of the deviation of the predicted 
from the observed egg production shall be as small as possible. 

Let E p " be the actual and E P ' the predicted egg production of an in- 
dividual bird for any period, p , in a flock to which the equation is being 
applied. The error of prediction is then E t ' — E P ." The average of these 
errors, with regard to sign 



H e *" e/ ) 



N 

furnishes a measure of the success with which the first requirement, (1) 
above, is met. The average of these errors without regard to sign fur- 
nishes a measure of the average error above or below the true production 
of the individual birds of a flock. The square root of mean square devia- 
tion 



IXv-v)']* 



furnishes a measure of this error which weights larger errors. 

The errors may be expressed in actual numbers of eggs, or, in relative 
terms, as percentages of the mean production of the period and flock for 
which prediction is made. Both methods have been used in testing the 
equations. 

In testing the efficiency of such equations for purposes of prediction 
we have proceeded in a purely objective manner. Working on the as- 
sumption that the crucial test of any theory is its capacity for predicting 



Vol. 7, 1921 



GENETICS: J. A. HARRIS ET AL. 



215 



the unknown, we have calculated equations based upon the data of the 
International Egg Laying Contest at Storrs, Conn., during the six contest 
years, 1911-1917, inclusive. We have then used these equations to pre- 
dict the annual production (and the production of groups of months) for 
the birds of the 1917-18 contest, using as a basis of prediction the individ- 
ual months of the laying year separately, pairs of successive months and 
groups of three months. Our conclusions concerning the value of the 
equations depend, therefore, not upon a priori considerations but upon the 
results of actual tests of accuracy of prediction for series which were un- 
known as far as the determination of the constants of the equations is 
concerned. 

Consider first of all the results of the attempts to predict the annual 
egg production of 415 White Leghorn birds observed at Storrs from Nov. 
1, 1917 to Oct. 31, 1918 from the records of a single month's production. 

The results of the three criteria of accuracy of prediction are summa- 
rized in table 1. 

TABLE 1 

Errors of Prediction op Annual Bgq Production prom thb Records 
op Individual Months 



MONTH 


AV8RAGB 


DEVIATION 


AVERAGE 


DEVIATION 


SQUARE ROOT OP MEAN 


USED AS 


WITH REGARD TO SIGN 


WITHOUT RBGARD TO SIGf 


I SQUARE 


DEVIATION 


BASE OP 
PREDICTION 


Actual 
deviation 


Percentage 
deviation 


Actual 
deviation 


Percentage 
deviation 


Actual 
deviation 


Percentage 
deviation 


November 


+ 2.39 


1.52 


29.59 


18.78 


38.65 


24.52 


December 


— 0.49 


0.31 


29.26 


18.57 


37.61 


23.86 


January- 


+ 2.58 


1.64 


30.09 


19.09 


38.77 


24.60 


February 


+ 0.06 


0.04 


27.28 


17.31 


34.70 


22.02 


March 


— 1.63 


1.03 


27.95 


17.73 


34.28 


21.75 


April 


— 6.23 


3.95 


28.72 


18.22 


35.31 


22.40 


May 


-1- 7.02 


4.45 


28.62 


18.16 


35.89 


22.77 


June 


— 5.21 


3.31 


29.03 


18.42 


36.53 


23.18 


July 


— 5.27 


3.34 


28.35 


17.99 


35.89 


22.77 


August 


— 0.82 


0.52 


26.87 


17.05 


34.34 


21.79 


September 


+ 4.78 


3.03 


24.78 


15.72 


32.94 


20.90 


October 


+ 3.95 


2.51 


27.37 


17.37 


36.47 


23.14 



Considering first of all the absolute values we note that the average 
errors with regard to sign are generally low. Thus the prediction from No- 
vember and from January production gives on the average 3 eggs too many 
for the year. For December, February, March and August the predic- 
tion is in error by less than 2 eggs. The values predicted from April, May, 
June, July, September and October records are from 4 to 7 eggs in error. 

The average deviations without regard to sign are of course much larger 
since they constitute a measure of the error of prediction of the records of 
individual birds. They range from 24.8 to 30.1 eggs. The significance 
of errors of this magnitude will be more clearly brought out later. 

The square root of mean square deviation also shows considerable regu- 



216 GENETICS: J. A. HARRIS ET AL. Proc. N. A. S. 

larity from month to month. These measures are naturally considerably 
larger than the average deviation without regard to sign. They range 
from 32.9 to 38.8 eggs. 

It is clear that the annual egg production of birds similar in origin to the 
series upon which the prediction equations were based and maintained un- 
der similar conditions may be predicted with a relatively high degree of 
accuracy providing their record for any month is definitely known. 

The order of the errors will be more readily understood by expressing 
them in relation to the average production of the flock, as shown by the 
percentage deviations. 

We note that in predicting from December, February and August records 
the average error with regard to sign is less than one per cent of the average 
annual yield of the flock. In predicting from November, January and 
March the error lies between one and two per cent. When April, May, 
June, July, September and October records are used as a basis of predic- 
tion the average errors of prediction are from 2.50 to 4.50 per cent of the 
average annual yield. 

The average deviations without regard to sign are less than 20 per cent 
of the annual production. The values for the individual months range 
from 15.7 for September to 19.1 for January. 

The square root of mean square deviation is less than 25 per cent of 
the average annual production. The individual values range from 20.9 
for September to 24.6 for January. 

These two latter tests may at first seem to indicate very unsatisfactory 
prediction. Such is not, however, the case. These give the average 
errors either above or below the true record made in the prediction of the results 
for an individual bird. The thing which is required in practise is generally 
the prediction for a group of birds of a particular grade of egg record for the 
month used as a base of prediction. In a flock of 41 5 birds this has been 
shown to be possibe with an error of less than 5 per cent of the annual produc- 
tion when prediction is made from the record of any month of the year; 
and with an error of less than 1 per cent when prediction is based upon the re- 
cords of a number of the individual months. 

Lack of space precludes a discussion of the results of the prediction of 
the annual record of the bird from the combined record of two consecutive 
months. We may, however, illustrate the accuracy of prediction from the 
combined record of two consecutive months by means of the figures in dia- 
gram 1 which shows the accuracy of prediction from November plus Dec- 
ember and from April plus May in comparison with the results of predic- 
tion from November and April. In these the estimated production is shown 
by a straight line. 

The actual production for the year for which prediction is made is shown 
by solid dots for each group of birds as classified by monthly or bimonthly 



Voi,. 7, 1921 



GENETICS: J. A. HARRIS ET AL. 



217 




2 4 6 8 10 12 14 IS 18 20 22 24 28 28 30 32 34 3G 38 40 42 44 4B 48 SO 

DIAGRAM 1 

record. The shaded areas are determined as follows. The birds were 
first grouped into classes of five eggs range with respect to number of eggs 
laid during the period of time used as a basis of prediction. The birds of 
these classes of five eggs range were further subdivided into those in which 
actual egg production was greater than the predicted and those in which 
the actual number was less than the predicted number. 4 The average 



218 



GENETICS: J. A. HARRIS ET AL. 



Proc. N. A. S. 



error of prediction was determined for each of these groups, and these 
averages represent the upper and lower limits of the shaded areas. The 
upper limit represents, therefore, the average deviation (for the period for 
which prediction is made) of all birds which make a higher record than that 
predicted for their class. The lower limit of the shaded area marks the 
average deviation for all birds which show an egg record lower than that 
predicted. These diagrams, which are quite typical of the whole series, 
certainly indicate excellent prediction. 

The results for the combined records of three consecutive months are 
shown in table 2. These show that greater accuracy of prediction 
may be obtained when the records of three months are used as a basis of 
prediction. Such a result is to be expected on a priori grounds. A care- 
ful comparison of the constants in tables 1 and 2 will show, however, that 
the improvement resulting from the trebling of the number of months used 
as a basis of prediction is not great. 

TABLE 2 

Errors op Prediction of Annum, Egg Production prom the Record 
of Three Consecutive Months 



THREE 
MONTHS 
USED AS BASE 
OP PREDICTION 

Nov.-Jan. 

Dec.-Feb. 

Jan.-Mar. 

Feb.-Apr. 

Mar.-May 

Apr.-June 

May-July 

June-Aug. 

July-Sept. 

Aug.-Oct. 



AVERAGE DEVIATION 
WITH REGARD TO SIGN 

Actual 
deviation 



+ 2.09 
+ 0.78 
+ 0.49 

— 4.07 

— 0.73 

— 2.31 

— 2.12 

— 5.35 

— 0.20 
+ 3.91 



Percentage 
deviation 

1.33 

0.49 
0.31 
2.58 
0.46 
1.47 
1.35 
3.39 
0.13 
2.48 



AVERAGE DEVIATION SQUARE ROOT OP MEAN 
WITHOUT REGARD TO SIGN SQUARE DEVIATION 

Actual Percentage Actual Percentage 

deviation 



deviation 

25.93 
25.31 
25.29 
24.16 
25.42 
24.33 
24.20 
23.49 
21.36 
21.59 



deviation 

16.45 
16.06 
16.05 
15.33 
16.13 
15.44 
15.36 
14.90 
13.55 
13.70 



deviation 

33.84 
32.65 
31.58 
29.77 
31.14 
30.59 
29.40 
29.80 
28.10 
29.23 



21.47 
20.72 
20.04 
18.89 
19.76 
19.41 
18.65 
18.91 
17.83 
18.55 



Prediction of the number of eggs which will be laid in the period subse- 
quent to the month or group of months used as a basis of prediction may 
also be made. The errors for such a series of predictions, in which each 
individual month of the year (with the exception of the final month) has 
served as a basis for the prediction of the egg production of the remaining 
months of the year, are shown in table 3. The constants in this table 
show that when the period for which prediction is made is a long one a de- 
gree of accuracy fairly comparable with that for the whole year is attain- 
able. The absolute values of the average deviation without regard to 
sign and of the square root of mean square deviation necessarily become 
smaller as the period for which prediction is made becomes shorter. The 
relative (percentage) error, however, increases. Thus the accuracy of pre- 
diction decreases rapidly as the period for which prediction is made be- 
comes shorter. 



Vol. 7, 1921 



GENETICS: J. A. HARRIS ET AL. 



219 



TABLE 3 

Errors op Prediction op the Record op a Period op Months prom the 
Record op Individual Preceding Months 

PERIOD FOR MONTH USED AVERAGE DEVIATION AVERAGE DEVIATION S£>. ROOT Off MEAN 

WHICH AS BASE WITH REGARD TO SIGN WITHOUT REGARD TO SIGN SQ. DEVIATION 

prediction OF Actual Percentage Actual Percentage Actual Percentage 

is MADE prediction deviation deviation deviation deviation deviation deviation 

Dec.-Oct. November +2.39 1.57 29.59 19.49 38.65 25.46 

Jan.-Oct. December +0.24 0.16 28.43 19.53 36.63 25.16 

Feb.-Oct. January + 2.37 1.71 26.48 19.06 34.61 24.91 

Mar .-Oct. February +0.09 0.77 24.07 18.66 30.86 23.92 

Apr. -Oct. March — 0.24 0.21 22.81 20.36 28.40 25.34 

May-Oct. April —4.00 4.23 21.36 22.59 26.50 28.02 

June-Oct. May + 3.62 4.98 19.89 27.35 24.81 34.11 

July-Oct. June — 2.86 5.34 17.62 32.91 21.44 40.04 

Aug.-Oct. July — 3.71 10.43 13.91 39.09 17.14 48.17 

Sept-Oct. August —2.56 13.57 9.76 51.75 12.12 64.26 

October September — 0.45 7.67 4.57 77.85 5.71 97.27 

The results of this investigation, taken as a whole, show that in the case 
of a flock of White Leghorn fowl which is essentially identical in genetic 
composition and maintained under essentially uniform conditions from 
year to year it is quite possible to estimate annual egg production from the 
record of either a single month or of two or three consecutive months with 
a high degree of accuracy. The same is presumably true of other breeds 
as well. This point is now under investigation. 

It is not possible to use the equations given in this paper for flocks differ- 
ing greatly in genetic composition or in conditions of maintenance from 
that upon which these equations were based. The problem of the deter- 
mination of corrective terms to be used when the equations are applied to 
flocks other than that upon which they are based is now under investiga- 
tion. 

A detailed account of these investigations is now in press in Genetics. 

!Alder and Egbert, Bull. Utah Agr. Exp. Sta., No. 162, 1918. 

2 Harris, Blakeslee and Warner, These Proceedings 3, 1917 (337-341) ; Harris, 
Blakeslee, Warner and Kirkpatrick, Genetics, 2, 1917 (36-77). 

"Harris, Blakeslee and Kirkpatrick, These Proceedings 3, 1917 (565-569); 
Genetics, 3, 1918 (27-72). 

4 A range of five eggs was used in order to obtain a number of birds sufficiently large 
to reduce somewhat the irregularities due to the errors of random sampling. The 
errors of prediction were in each case determined for classes of unit range. Grouping 
is used for graphic representation merely. The average deviations represented by the 
limit of the shaded zone are to be thought of as measured from a line perpendicular 
to the ordinates and intersecting the prediction line on the mid-ordinate of the 5-egg 
class. 



